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Abstract 

An ordered pair of semi-infinite binary sequences (77, ^) is said to be 
compatible if there is a way of removing a certain number (possibly infi- 
nite) of ones from 77 and zeroes from ^, which would map both sequences 
to the same semi-infinite sequence. This notion was introduced by Peter 
Winkler, who also posed the following question: and ^ being indepen- 
dent i.i.d. Bernoulli sequences with parameters p' and p respectively, does 
it exist {p',p) so that the set of compatible pairs has positive measure? 
ft is known that this does not happen for p and p' very close to 1/2. In 
the positive direction, we construct, for any e > 0, a deterministic binary 
sequence rj^ whose set of zeroes has Hausdorff dimension larger than 1 — e, 
and such that Pp{C: (??e,0 compatible} > for p small enough, where 
Pp stands for the product Bernoulli measure with parameter p. 

Keywords: compatibility of sequences, dependent percolation. 

1 Introduction 

Consider the set of all semi-infinite binary sequences S — {0,1}'^, with N = 
{1,2,...}. For each i G N define the "annihilation" operators A° and Aj : 
S — S, which act according to the following rules: if ^ = (^1, ^2, • • • ) ^ 
—0, then 



A°(0. = and A^(0=e; 



(1) 



and, respectively, if — 1, then 



A^iO^L and Al(a-|^^"+^' (2) 
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In other words, for every i € N, the sequence A°(^), respectively A^(^), is 
obtained from the sequence f by annihilating (deleting) the i-th digit if it is 0, 
respectively 1, and shifting all elements of ^ which are to the right of the i-th 
position by one unit to the left. We consider S as metric space with the usual 
product topology. 

Definition 1. Let r] = (7^1,772,...) and = (^1,^2,- •■) be elements ofE.. The 
pair (rj,^) is said to be compatible if there exist sequences (?7^'^^)fe>i and (^'•''•')fc>i 
mS such that: eachrj^^^ is obtained from rj by finitely many applications of oper- 
ators Ai*-; each is obtained from ^ by finitely many applications of operators 
A''; as k 00, both sequences converge in S to the same limit. 

Informally speaking, the pair (77, ^) is compatible if by deleting some ones in the 
first sequence and deleting some zeroes in the second sequence one can make 
them equal. 

For p e [0, 1], let Pp be the probability product measure on S = {0, 1}^ such 
that — 1) — p for all i > 1. Motivated by scheduling problems, P. Winkler 

(see [3]) posed the following question: 

Does it exist a pair {p,p') G (0, 1)^ such that 

Pp (g)Pp.{(7y,^) e S X S: (?7,^) are compatible} > 0? 

This question has been addressed in [4 . A simple form of Peierls argument (see 
[S]) shows that if p and p' are close to 1/2, then 

Pp (8)Pp'{(?/,C) e S X S: (77,0 is compatible} = 0. 

In this context, it is then natural to introduce the following 

Definition 2. For p £ [0, 1] we say that rj E R is p-compatible in S, or p- 
compatible for short, if 

Pp{$ e S: (77,^) is compatible} > 0. (3) 

For 1:= (1,1,1,...), the pair (1, £_) is compatible as long as the sequence ^ has 
infinitely many ones. Thus, for all p > 

Pp{C e S : (1, is compatible} = 1, (4) 

and if 1^ is replaced by 77 g S which has only finitely many zeroes, then for every 

< J3 < 1 

Pp{^ e S: (7;,^ is compatible} > 0. (5) 

If 7] has infinitely many zeroes but the distance between consecutive zeroes 
increases fast enough, for instance exponentially fast with a rate that suitably 
depends on p, it is straightforward to see that ([S]) still holds. However it is a 
priori unclear whether there are deterministic binary sequences 77 with a richer 
(and more complicated) set of zeroes Z^i := {i > 1 : rji — 0} which still can be 
p-compatible for some positive p. Our main result can be stated as follows: 
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Theorem 3. For every e > there exist < Pe < 1 and a binary sequence 
T] = rjg G such that Z^j^ is a discrete fractal in the sense of 12], with the 
Hausdorff dimension dH^Zr,) > 1 — e, and such that 



for any p < pe- 

For the proof of Theorem [3] it is convenient to exploit a representation of com- 
patibility of binary sequences in the language of dependent oriented two dimen- 
sional percolation. In Section [2] we describe the percolation model defining its 
configuration in terms of (rj,^), and give conditions on rj and ^ which guarantee 
the existence of an infinite open path from the origin. 

The key ingredient of the proof consists in showing that Bernoulli sequences 
^ with small density of ones can be suitably mapped by appropriate grouping of 
ones and removing unwanted zeroes into sequences that satisfy the conditions 
mentioned in the previous paragraph. This is done in Sections [3] and S] together 
with the explicit construction of a deterministic binary sequence rj. The proof of 
Theorem [3] is completed in Section 21 Details of the grouping lemma are given 
in the Appendix. 

2 Percolation process 

In this section we describe the related dependent percolation model. For this we 
first introduce an alternative representation of binary sequences which is useful 
for our purposes. 

Consider the set Sqo C S of all binary sequences ^ G S that contain infinitely 
many ones and infinitely many zeroes: 



Each ^ G Soo can be represented as an element /(^) of Z^, where each run 
of ones in ^ is replaced by a single coordinate whose value is the cardinality 
(length) of the run, with the corresponding shift to the left of the part of the 
sequence that follows the run. More precisely, define 



An element -0 S ^' will be called a weighted word and the value ipj G N will be 
called the weight of the j-th letter of tp. Given ^ € Soo, the sequence /(^) can 
be defined recursively: 



Pp{^ G ^: is compatible} > 



{ee{o,if : |{»:e.^em}i = ^}. 



^ ^ {ip e : tjji > 1 implies t/i^+i 



0}. 




fc, if = 1, for s = 1, . . . , k, and ^k+i = 0, 

0, if a = 0. 



(6) 
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If 1 < « < i — 1 are defined, for j > 2, next we define 



^^-i(0 = E((/(0),+I[(/(0).=o]), (7) 

and we set 

if = 1, for s = + r = 1, . . . , A:, 

and a,_x(«)+fe+i=0, (8) 
0, if C,,._,(e)+i = 0. 

The just defined map /: Soo — ^1/ is one-to-one, and so defines a bijection 
between E^o and tlie subset of ^ of weighted words which have infinitely many 

non-zero entries. 

For each « G N define the "annihilation" operators A° and Aj : ZjJ , which 

act according to the following rules: if ^ = {ipi,tp2, ■ ■ ■) G and tpi > 1, then 
= V'j otherwise and when at least one among tpi-i or tpi+i is zero, or 
i = 1, then 

A°w. = (^^^' (9) 



When i > 2, tpi = but V'i-i A i'i+i > 1) then 



ipj+2, ifj>«, 

ifj = i-l, (10) 
Vi, ifi<i-l. 



Similarly, if ipi = 0, then A|(f/') = ^- Otherwise, when tpi = 1 we set 

V'j, ifj<i, 



A,i(^),=r^+^' i^^:^^' (11) 



while when > 1 we set 

Aj(^).=(t^-'' ^;^=^' (12) 

Definition 4. Let C = (Ci: C2, • ■ • ) (^nd = {'^i,ip2, • ■ • ) be elements of^. The 

pair (C,'0) is said to he compatible if there exist there exist sequences (C^'^-')fc>i 
and {ip^''^)k>i in * such that: each C^*^^ is obtained from ^ by finitely many 
applications of operators A^; each V'^^^ is obtained from ip by finitely many 
applications of operators A°; as fc —>■ 00, both sequences have a common limit 
in \1/ (product topology). 

The following proposition follows at once from the definitions. 
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Figure 1: Simulated samples of the open oriented cluster for pair of geometric 
variables {C,ip). Unusually large values along the x-axis create long vertical 
open segments, while large values along the j/-axis produce drastic horizontal 
cuts. 

Proposition 5. Let (77,^) be a pair of configurations in Sqo, and let C = f{ri), 
tp = /(^) with f the map defined in ((6])~([8]). // the pair (CiV') compatible, 
then so is (77, ^). 

Percolation process. The percolation process will be defined on the oriented 

graph Q = (V, E), where V = Z"]_ and E = {{v,w);wi — vi and W2 = W2 + 
l;wi — vi + 1 and it;2 = W2 + 1}, with v ~ {vi,V2),w = {■wi,W2), i.e. the 
edges consist of the nearest neighbor vertical edges and the northeast oriented 
diagonals on the first quadrant of Z^; all the edges (f , w) are oriented from v to 
w. 

Given two sequences C, V' S we define the north-east oriented site percolation 
configuration on Q by setting: w^_^(0, 0) = 1, for w = (wi, ^2) G V \ {(0, 0)} 
set ^^(ti) = if A W2 = 0, while for vi,V2 > 1: 

.X / 1 if C«i > : /1 q^ 

I otherwise. 

We say that v is open if a;^_^(u) = 1. An oriented path t: := {u = v^-^\ e^^\ v^^\ 
. . . , e^"-*, = v} from u to w is said to be open if each vertex u*-'^ is open 
(here e*^*^ = (v'*^^) , w^*)) £ E for each i). We say that f G V belongs to the open 
oriented cluster of a vertex u if there is an oriented path. The open oriented 
cluster of the origin is denoted by C(^^^) = C(^^^)(0). More generally, given any 
finite subset / C Z^, by C(^_^)(/) we denote the open oriented cluster of /, i.e. 
the union of the open oriented clusters of the vertices in /. 

Definition 6. A vertex v = (wi,W2) G Z^ is called "heavy" if V2 — or, when 
V2 > 1, ifi>v2 > 0. 
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Figure 2: Illustration of the open oriented cluster of the origin. Non-zero entries 
marked along the axes. Marked in bold a longest open permitted path. 

Definition 7. Given two weighted words C^, ip £ and the associated percola- 
tion configuration ujq^^, we say that an infinite oriented path tt starting from the 
origin is permitted if for any pair of heavy vertices u = (wi, M2) and v = {vi^V2) 
in IT we have ui ^ Vi. 

Lemma 8. Let C, "0 £ ^- V there exists an infinite open permitted path tt 
starting from the origin for the percolation configuration ujq^^, then the pair 
((^, "0) is compatible. 

Proof. Assume that C, and tj: satisfy the conditions in the statement, and let tt 
be an infinite open permitted path starting from the origin. Let us first assume 
that il) contains infinitely many non-zero entries. In this case, we consider the 
increasing sequence of indices {(-j}^^ that correspond to all non-zero entries of 
tp, i.e., V'i ^ 1 if and only if i = €j for some j. In particular (.j+i > + 1 for 
all j > 1- Let {ve- = {xej,ij)}^J^ be the corresponding sequence of heavy open 
vertices in tt \{(0,0)}, and set £q = 0,xo = 0, = (0,0). Since tt is permitted, 
we have that xi. < x^., if j < j' . Since V(,. is open we also have that Qxi. > V'fj 
for all J > 1. If tt' is an arbitrary oriented path on 5, and v = (wi,U2) and 
u — {ui,U2) are on tt', then \u2 — W2I > \ui — vi\. In particular for the open 
permitted path tt we have ij — £j-i > xe^ — xe^-i > 1 for all j > 1. 

Denote hy jk,i < ■ ■ ■ < jk,mk the indices of non-zero entries of Q which 
lie strictly between xi,^_-^ and a;^^. This set could be empty, in which case set 
ruk ~ 0. Notice that xi^, — X]s=i '^s strictly increasing in k. 
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Define: 



.(1) .= / ([^\, J'^'^'^ ° • • • ° ) ° io, if mi > 1, 

The action of [A^.^ on ( decreases the vahie of Crj^ to tjj£j_, and the 

action of fA^ l^'i.i o • • • o [A^ I^-ji,™! deletes ah non-zero entries of C which 
precede Cx^^ ) ^-e. 



>(i) 



Now define 



0, if i = 1, . . . - mi - 1 

V'^i. if j = xe, - mi 

.Cx^i+i ifj = -mi +z, i = l,2,... 

V-^^^ := [A°]^i-^*i+'"i(V'), 



i.e. 



0, if j = 1, . . . - TOi - 1 
i'ii, i{j = Xi^-mi 

iij = xe^-mi+i, i = l,2,... 

Thus, Cj^'' — ipj^"^ for j = 1, . . . , — mi. One should notice that ■0'^-'^^ and C^"'^-' 
arc both elements of for this we recall that (^xe^ > V'^i > 1 and therefore 
Ca:«,+i = 0, since C e 

Assume to have constructed (^''^ and ip^''^ in ^, and which satisfy the following 
property: 



Ak) I (fc) 



= C 



if j <xe^-Y^ 

s=l 

k 

if j >xe^-Y^ 



nis 



(14) 



ms-xe. ' 



if i > - X] 



rris. 



Recalling that Xi^ — Yl'l=i i^ strictly increasing in k, we proceed as follows: 
If mfc+i > 1, define 

^(fc+i) rr^i o • . ■ o fA^ 

^ Jfc + l,l-Es = l '"sJ ^ Jfc + l,ms,_,_i-Es = l'"s-' J 



while if mfc+i = 0, set 



^(fc+i) 
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On the other hand, set 

The action of the operator [A-*- „t on C'-'^-' decreases the 

value of Q ^ from C^^f^^^ to V'ft+n ^-nd the action of the operator 

[A^ „fc l^^fc+i.i o- • -ofA^ „fc l^^'+^'^k+i ehminates ah non- 

zero entries of C'*'-' which he strictly between indices x^^. — X]s=i ^^'^ ^(k+i ~ 

Ek 

As in the case fc = 1, we can check that both sequences ('('^+1) and i/j^'^+i) 
are in 5' and satisfy properties of (|14l) with k replaced by fc + 1. Proceeding 
recursively, we get that the following limits exist in 5': 

C^°°) := lim C'''\ and := 1™ V'^''^ 

and C^""-* = which implies compatibility of C and ^. 

When -0 has only finitely many non-zero entries, ?/;(''+^) = ^C') for all fc large 
enough, after which one deletes the following ones in 

Given fc e N and ip e'i', let 

, , , I minjn e N: ipn > k}, if such n < +oo exists, ,^ ^, 

I +00, 11 tpn < k tor all n > 1. 

Definition 9. Let M > 2 be an integer. A sequence ip is called M-spaced 
up to level k if the following conditions are satisfied: 

a) j -i> M™"{'^-'^J>, for all I < i < j, both in N, (16) 

bk) ijW > M\ for all j < fc. (17) 

A sequence ip € is called M-spaced when a) holds and bk) is replaced by b' ): 
b') ij{^) > M\ for all ]>!. (18) 

We denote 

^'m := {C e C is M-spaced up to level fc}, 
*Af := e *: C is M-spaced}. 

Definition 10. Let L > 2 be an integer. Let C{L) £ ^ l be the sequence whose 
j-th entry, j > 1, is given by: 

(C(L)),:=|^' '^^'l^- ^"^^^"M^' (19) 
' lO, tf L\j. ^ ' 
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We now state the main result of this section. 

Theorem 11. Let L > 2 and M > 3{L + 1) be integers, ip G ^m, and C{L) 
given by il9\) . Then the configuration w^(^) ,^ defined in (I13p has an infinite 
open permitted path tt starting from the origin. 

The proof of Theorem [TT] will follow from a sequence of technical statements, 
the most important is given in Proposition I16[ whose proof, in turn, relies on 
that of Proposition [T71 

Remark 12. For the proof of Theorem \ll\ (see below) it is enough to treat the 
case luhen ik{ijj) < +oo for all k > 1. For this reason, in the next statements 
we shall assume that all featuring ik{'^) are finite. 

For every m G N and ^ G ^P, let dm^ G ^' denote the shifted sequence given 
by {SmOj fj+m, j > 1- Set 6*0^ = From Definition [TU] it follows that for 
every m and n G N, the sequence 0„il^C{L) satisfies, for all j > 1: 

{emL^C{L))j l = n' if L"' I 3 but L"'+i \ j, for 1 < n' < n, (20) 
[ > 71 if I j. 

Remark. The sequence 0]^kC,{L) is not in ^l, since the inequality is 
violated. However 0^]^kC,{L) G for any to G N. 

Proposition 13. If if) G for some /c G N, andi >\ is such that ipi = m > 1, 
then Oiil) G In particular, if ip G ^m, then S 5'^/'°'''''. 

Proof. To verify inequality (fT6|) for Oiip observe that since -0 G and if j > j' , 
then 

j^jniin{{eiip)j ,{ei4>)-,} _ ^^min{V>i+j,)/'i+j'} < j — j\ 

On the other hand, if j > 1 is such that {Oi'il')j < m, then we have: 

][.fie,4>)j ^ = ^min{v>i+3,Vi} <i + j-i = j, 

and thus 9iip satisfies (|T7l) . □ 

Notation. Since the parameter L > 2 is fixed, we will omit it from the notation 
when this will bring no confusion. 

Define the horizontal slab := {v = {vi,V2) G Z+ : i < V2 < j}- 

Lemma 14. Let C = C{L) be as in Definition \l(A For any k G m G N and 

•0 G ifik = ikW, 

^(e c» ^^[o,ifc] n§[o,i^], (21) 

mL k 

in particular, for any m, m' G N 

^{0 i,, c,^> n§[o.jfc] = C(e f,^) n§[o,i^]. (22) 
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Proof. Since the origin is the unique open vertex along the axes, to prove (|2ip 
it suffices to consider vertices v = (vi, V2) with 1 < wi, 1 < W2 l£ ik, and to show 
that Ct)i > ipv2 if and only if C„^+„i*i^. > tpv2- 

Let us first assume that > tpy^ . By the first two equalities of (PU]) we have 
that = Cvi whenever vi is not multiple of L'^'^ . If vi is multiple or 

!/'''•'= , then by the third inequality of ([20| we have that C„ ^mL'^'^k — '^^k ■ ^O"^ 
we use the fact that V2 < ik, which implies that ip^^ ^ '4'ik ; ^^e definition 

of ik- Thus we have C . > lAi;, in both cases. 

^v-L+mL k — ^ 

On the other hand, if C.vi < ipv2 and V2 < ik, we get Cui < V'u 2 ^ V'ifc • This 
implies that vi cannot be multiple of L'^'^ . Thus, as just observed, C„^_|_„£,"/'ifc = 

As already explained, this proves (PT|) . Equality (1^^ follows immediately, 
concluding the proof of Lemma [141 CH 



The set / is called horizontal discrete segment if the elements of the set / are 
vertices of whose first coordinates are consecutive integers and the second 
coordinate is the same for all elements. We denote by and r{I) respectively 
the value of the abscissa coordinate of the left and right endpoints of /. 

For C, V' S ^ we set: 

^(C ip)(^) '■~ ^ ^+ ^ {^} ■ ^ open permitted path tt from the origin to v} 

(23) 

Remark 15. Let M > 2 and ip € '^\f. Since ii{tp) > M > 2 and ip^ ^ for 
i < iiijp), all the vertices in {(wi,W2): 1 < vi,V2 < iii'ip) — 1} are open in the 
configuration loq,^, for any C G 5*. It easily follows that 

%^>(*i(V')-i) = {(j,u(V')-i); j = i,...,iiW-i}. (24) 

for all ip G ^lf,C £ ^- (Indeed, any open oriented path connecting the origin 
to the set {{j,ii(ip) — 1); j ~ 1, . . . ,ii(V-') — 1} will also be permitted.) 

The next proposition summarizes some basic properties of V^^ ,^^(fc) in the 
case of interest for Theorem [TTl 

Remark. Throughout we adopt the usual convention = for G K.) 

Proposition 16. Let ( — ({L) be as in Definition [T0| and ip G ^Ji/f' "^^^^ 
M > 3{L + 1) and k > 1. For any m G Z+ the following holds: 

i) ^(9 ^kC,,'4')^^k{i^) ^1) is a discrete segment. 
III) r(y(,^^,^,^)(*fc(^)-l)) >Zfc(V.)-max|^(L'- + l)["^^'^^"^' 



r=l 
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Proof of Proposition \16] (beginning ) 

We shall proceed by induction. The statements are trivial in the case k = 1, 
where indeed equality holds, cf. Remark [T5l 

By induction hypothesis assume now that V(^g ^^i^_,p^{ik{il>) ~ 1) satisfies i), m), 
and Hi) for fc, and we proceed to the case A: + 1. For that we assume that 

If ifc+i(i/') = «fc(V'), then Vg^^^^^^^^{ik+i{il)) - 1) = "^e^^,+iC,V'(«fe(V') - 1) and, 
therefore, i), ii), and Hi) are satisfied. We thus consider the case ik+i{ip) > 
ik{ip)- The last inequality implies that '4'ikitl>) — ^■ 
To proceed forward we will need the following proposition. 

Proposition 17. LetC. and M be as in the statement of Proposition^^ Assume 
that i) - Hi) of Provosition\TSihold for some k > 1 and that ip satisfies property 
Hid]) with M > 3(L + 1). Let the index ji be such that ^pj-^ — k. Define 
j2 := ji + ikiOjitjj), i-B- 32 is the smallest index i > ji, such that ipi > k. Fix 
a discrete segment /i C N x {ji — 1}, such that \ > L'^ (we use \I\ to denote 
the cardinality of a set I). Then the following holds: The set 

/2 := {u G N X {^2 — 1} : 3 open permited path tt from Ii to v} 

is a discrete segment and 

fc-1 



j2 - ji - 1 



M'' 
fc-1 

E( 

r=l 
fc-1 



^(/2) </(/l)+i'+5I(^' + l) 

r=l 

rih) > r{h) + {32 - Ji - 1) - L'^ - ^(L^ + 1) 
In particular we have 

Ihl > \h\ + {32 - 31 - 1) - 2L'^ - 2^(^' 



1) 



r=l 



32 - jl - 1 



32 -31 



(25) 

(26) 



(27) 



Proof of Proposition |_?7| Define 

Ui = {u = {uiji) : {ui,ji - I) e h and L'' \ ui}. 
Since > we have 

f 1, 



\Ui\> 



\Ii\-2L 



if L'= < < 
-J if|/i|>3i'=. 



Enumerate the vertices of Ui in increasing first coordinate order: 
C/i = K = (ul,ji),...,u^^ = «,ji)}. 



(28) 
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Since C„r > fc, for all 1 < r < si, under assumption that i/'ji — k we have: 

u'' = K, ji) G C(^,^)(/i) n (Z X {ji}), r = 1, . . . ,si. 

The same argument as in the proof of Lemma [T3] apphed to pair of sequences ( 
and gives 

C(e^iC,ejii« n §[o,j2-ji-i] = C(e„jC,0ji'A> '"'^[0,^2-51-1]: for 2 < r < si, (29) 

i.e. each C(ij^^)(u5^) fl is shift of C((;^^,)(m}) n §[jj by (r — 1)L'^' to 

the right. 

By Proposition [T31 the sequence ip satisfies the conditions of Proposition [12] 
(recah that ji is such that ijjj-^ = fc). Thus, applying i) - in) subsequently to 
pairs of sequences Ou^(^ and Oj-^ip, for 1 < r < si, we have, thanks to validity of 
i) for fc, that the set 

^(9„jC,enV'>(*fc(^JiV') - 1) 
is a discrete segment for all 1 < r < si. Next we argue that 

Sl 

^\J[Vi9^.C9,,i.){ikiOj,^) - 1) +^1. (30) 

Though the validity of the equality pop is nearly obvious, it is not immediate. 
One should take care of the following: if there are two permitted paths tt and 
tt', from X to y and from y to z, respectively, their concatenation at the vertex 
y does not necessarily result in a permitted path from x to z. However (recall 
Definitions [6] and [7]) when y is a heavy vertex, as the case here for y = u^, one 
automatically gets a permitted path. 

Next we shall show that for all 2 < r < Si 

r{V^e ,_,c,9n^>(*fc(^ii^)-l)+"''"') ^K^<«„K.«.,V)(*fc(^iiV')-l) + "0 (31) 
For this we need the following 

Lemma 18. Let L and M > 3L be positive integers. Then for any fc G N and 
a > M'^ , we have: 

k-l 

a>L'' + 2aJ2(L/My. (32) 

Proof. A trivial computation shows that the r.h.s. of ([5^ is bounded from 
above by L'' + a(l — 3~'^~^^), and the inequality follows at once. □ 
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We return to the proof of pip . Applying ii) and in) we have 

k-l 

32- 



>j2-Jl-l-5](i^ + l) 
1=1 

>L'^ + ^(r' + i) 



A/' 



fc-i 



4=1 



j2 - jl ~ 1 
Mi 



>l{V(eM,^){^k{e,,^)-l) + u^), 



(33) 



for all 2 < r < si, and where the second inequality is implied by ([32|) with 
L = L + 1. Inequality p3p immediately implies that I2 is the discrete segment. 
Next we will show 1^ and 



< ^(^(fl„K,e,,v>(*fe(%V^) - 1) + ^(^i) + 

ikidjiip) - 1 



fe-i 



<;(/i) + L'^ + E(i* + i) 
1=1 
fc-i 

= /(/i) + L'^ + E(L' + l) 



J2 - ji - 1 



where in the third line inequality of the above display we used ii). This proves 
((25)) . On the other hand 

> <yK,^Ce,,^)i^k{eJ,i>) " 1)) +r(/i) ~ L'^ 



fe-i 



> r(/i) - L'^ + ife(0,,^) - 1 - E(L' + 1) 



j=i 
fc-i 



"(/i) + (j2-ji-i)-i'=-E(^* + i) 



j2 - jl - 1 



where in the third line inequality of the above display we used ii). This proves 
((26|) and finishes the proof of the Proposition [171 O 



Remark 19. Proposition [TTl remains valid if we replace C by any shift 9nC of 
it, in particular by 
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Proof of Proposition l_?61 ( continuation ) 

Now we return to the proof of i) - Hi) for the case A;+ 1, in order to complete 
the induction step. Recall that we assume that ik+ii^l^) > 'iki'ip), which implies 
that 'ipi^^^) = k, and we also assume that G ^a/^""^- 

Define iteratively: 

jri jn-i+ikiOj„_i-ip), for n = 2,...,n*, 
where n* = minjn: ipj^^ > k + 1} that is, j„* = Fix to > 0, and define 

^1 ,^)(ifc(?A) - 1), 

/„ := {x e C(0^^^^j(;^^)(/„_i) n (Z X {j„ - 1}) : 3 permited path from /„_i to x} 
for 2 < n < n*. 

First of all we notice that, as in the proof of Proposition [171 

In' = V(^e^^^+,c,i,)iik+iW - !)■ 

Observe now that the sequence 6j^_-^ip, for any 1 < n < n* satisfies the condi- 
tions of Proposition [171 We will apply it to each /„ iteratively, starting from Ii. 



i) - case k+1. Due to the first part of the Proposition [T71 we have that all /„, 
\ <n <n* are discrete segments, and therefore Vi^g ^^^^/^ ^^){ik+i{'^) — 1) is a 
discrete segment. 

ii) - case k+1. 



/(/„.) ^ K^(e„..+iC,^)fe+i(V') - 1)) 



</(/„._i)+L^^ + ^(L'- + l) 



1* fc-1 



<l{h) + {n*-l)L^+Y,Y.iU^ + l) 

S- 

n* k~l 

<rf+^^(L'- + l 



js - is-i - 1 



s=2 r=l 



\ 



< 



< 



ik+i['^) - 1 
ik+iji') - 1 



k—1 n* . . - 

r— 1 s— 1 



fc-1 



r=l 



ifc-+i(V^) - 1 



14 



which proves validity of ii) for the case fc + 1. 
in) - case k+1. 

r{ln') = r{y(0^^^+^i,^){ik+i{i^) - 1)) 

> r(/„._i) + On. - j„._i - 1) - L'' - Y,{U + 1) 



fe-i 



in' -I - 1 



fc-1 



> r{h) + J2ijs - js^i - 1) - n*L^ - Y.J2(V' + 1) 



js - is-i - 1 



n* k—1 



s—1 r—1 

in' - 1 



> Y^ijs - j.-i - 1) - - Y.Y.^L'- + 1) 
>j„. -n*-n*L'=-^(L'- + l) 

fc 

>2fc+i(^)-^(L'^ + l 



is - js-i - 1 



ife+i(-0) - 1 



Af 



J. 



which proves vahdity of Hi) for the case fc + 1, and finishes proof of the Propo- 
sition [TH □ 

Proof of Theorem 

If ■!/; £ 5* is such that ikii^) < oo for ah fc, the statement follows at once 
from Proposition [HI If ik{i){4') < +0° and ik{i)+iii^) = +oo, one repeats 
the argument of Proposition [16] to the sequence ^ij.^!, (ip) for ji,j2, ■■■ as in the 
proposition with fc = k{l); if this stops, then one moves down to the next 
fc(2) < fc(l), and so on. □ 

The next result is a straightforward corollary of Theorem [TT] and Lemma [H 

Corollary 20. If C{L) is given by (dH) and ^ & m with M = i{L + I), then 
the pair {({L),ip) is compatible. 



3 Grouping 

In this Section we show that if a binary sequence ^ is sampled from Pp with 
low density p of ones, then with positive probability it can be viewed in a 
certain sense as an "Af-spaced sequence" . The precise statement is formulated 
in Corollary [23] This is obtained via an algorithm that suitably groups the ones 
in ^ into clusters and attributes an adequate weight or mass to each cluster. 
The construction, with a hierarchical structure, was developed in and is 
presented here for sake of completeness. This section is mainly devoted to the 
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description of this grouping procedure. The proofs of its convergence and further 
consequences needed here are taken from [7] and included in the Appendix. 

Let ^ S S be distributed according to Pp, where p = Pp(Ci = 1) wih be 
assumed smah. Let F = r(^) = {i g N: = 1}. We shaU decompose T 
into sets Ci, cahed clusters, to which an N-valued mass m{Ci) is attributed 
{m{Ci) < Id) in a way that d(C,,Cj) > M™{™(C-)'™(^^)>, where d{Di,D2) 
denotes the usual Euclidean distance between two sets Di and D2, and |C| 
stands for the cardinality of the set C. 

The clusters Ci := Coo,i will be obtained by a limiting recursive procedure. 
We will build an infinite sequence {Ck}k>o of partitions of F. Each partition 
Cfe is a collection = {Ck,j}j>i of subsets of F. The construction depends 
on the parameter M {M > 2 a large integer to be fixed later according to the 
conditions of Lemma [21] below). The clusters will be constructed as to have the 
properties 

each Ckj is of the form / n F for an interval /, (34) 

and 

span(Cfcj) n span(CfcjO =0 if /, (35) 

where span{C) is the smallest interval (in Z+) that contains C. To each cluster 
Ck,j we will attribute a mass, m(Ck.j), in such a way that 

d{Ck.j,Ck.j') > Af", if min{r7i(Cfcj), m{Ck.j')} > r, for r = 1, . . . , and j ^ j' . 

(36) 

To each cluster Ckj we shall further associate a number ({Ckj) G {0, 1, . . . , fc} 
which will be called the level of the cluster. This level will satisfy 

< ^(Cfcj) < m{Ckj). (37) 

Finally we construct the (limiting) partition Coo — {Coo,j}j>i of F: 

span(Cooj ) n span(Cooj') = if j ^ /• (38) 

To each cluster Cqoj of Coo we will attribute a mass m(Coo,j), and a level 
^(Cooj)j in such a way that 

0<^(Cooj) <to(Cooj), (39) 
and the following property holds: 

d{Cooj,Coo,j') > M'^ , if min{rn(Coo,i), m(Coo.j')} ^ '^j fo^ > 1 and j ^ /, 

(40) 

or equivalently, 

rf(Coo,j, Coo./) > M--^"('^~-)'"('^~..')>, for j ^ f. (41) 

The construction. Let the elements of F be labeled in increasing order: F = 
{xj}j>i with xi < X2 < 
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Level 0. The clusters of level are just the subsets of F of cardinality one. 
We take Cqj ~ {xj} and attribute a unit mass to each such cluster. That 
is, to(Coj) ~ 1 and £{Coj) = 0. Set Co,o = Co = {Co,j}j>i- Further define 
a{C) = w(C) = X when C = {x} is a cluster of level 0. 

Level 1. We say that Xi, x^+i, . . . , form a maximal 1-run of length n >2 

if 

Xj+i — Xj < M, j = i, . . . ,i + n — 2, 



and 

Xj+i — Xj > M 



for j^i — l,j = i + n — 1, if i > 1 
for j = i + n — 1, if 1 = 1. 



The level clusters {xi}, {xi+i}, . . . , {xi^n-i} will be called constituents of the 
run. Note that there are no points in F between two consecutive points of a 
maximal 1-run. Also note that if > M and Xj — Xj—i > M , then Xj 

does not appear in any maximal 1-run of length at least 2. 

For any pair of distinct maximal runs, r' and r" say, all clusters in r' lie to 
the left of all clusters in r" or vice versa. It therefore makes sense to label the 
consecutive maximal 1-runs of length at least 2 in increasing order of appearance: 
rl,rl,.... It is immediate that Pp-a.s. all runs are finite, and that infinitely 
many such runs exist. We write rj = rj^Xs^ , Xsi+i, ■ ■ ■ , Xsi+m-i), i ^ 1, • • • , if 
the i-th run consists of Xs-,Xsi+i, ■ ■ ■ ,Xs;+„._i. Note that Ui > 2 and Si + ni < 
Si+i for each i. The set 

— i^''^ Si T Si-\-l ^ ■ ■ • ^ Si-\-ni — l} 

is called a level 1-cluster, i.e., ({CD — 1. We attribute to Cj the mass given by 
its cardinality: 

m{Cl) = n^. 

The points 

aj = Xs^ and ujI = Xs^+m-i 

are called, respectively, the start-point and end-point of the run, as well as of the 
cluster Cf. To avoid confusion we sometimes write more explicitly a{Cl), ujiCl). 

By Ci^i we denote the set of clusters of level 1. Let Cq = {{xi}: Xi G 
F \ C for all C € Ci,i} and Ci = Ci,i U Cq ^. Note that Ci,i and Cg ^ consist 
of level 1 and level clusters, respectively, and that the union of all points in 
these clusters is exactly F. We label the elements of Ci in increasing order as 
Ci,j,j > 1. For later use we also define Co,i = Cq. Notation. In our notation 
Cj denotes the j"* level 1-cluster, and Cij denotes the j*'* clement in Ci (always 
in increasing order). 

Level k+1. Let k >1 and assume that the partitions Ck' = {Ck'j'- j > 1}, 
and the masses of the Ck' .j have already been defined for k' < fc, and satisfy the 
properties ([34)) - p6l) and that consists of clusters C of levels i G {0, 1, . . . , A;}. 
I.e., 

CfeCUtoCz,,, (42) 
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where for I > 0, C;.; is the set of level £ clusters. We assume, as before, that 
the labeling goes in increasing order of appearance. Define 

Cfe,fe+i = {C e Cfe : m(C) > fc + 1}. (43) 

Notice that Ci,2 = Ci,i and Ck,k+i Q ^e=i^i,i^ if A; > 1. 

In the previous enumeration of C^, let ji < j2 < ••• be the labels of 
the clusters in Ck,k+i, so that Ck,k+i = {Ckj^,Ck,j2^ ■ ■ ■}■ In Ck,k+i we 
consider consecutive maximal (fc + l)-runs, where we say that the clusters 
Ckj,,Ckj^^i, ■ ■ ■Ck,j,^„_-i € Ck,k+i form a maximal [k + l)-run of length n > 2 
if:' 

d(Cfej, , Ck,j,+^) < M'=+\ i = s,...,s + n-2, 

and in addition 



d{Ck,u,Ckj,^.) > M'^+i 



for i = s — 1, i = s + n — 1, if j'g > 1 
for i = s + n — 1, if = 1- 



Again it is immediate that Pp-a.s. all {k + l)-runs are finite and that infinitely 
many such runs exist. Again we can label them in increasing order and write 
rt^^ = ri~^^{Ck,j,.,Ck,j,.+i, . . .,Ck,j,.+^._J for the i-th (A:+ l)-run, for suitable 
Si,ni such that Ui > 2 and Si + Ui < Sj+i for all i. {si,ni have nothing to do 
with those in the previous steps of the construction.) We set 

a^+'=a{CkjJ and = a;(Cfe,,,^^„^_ J, 

and call these the start-point and end-point of the run, respectively. We define 
the span of the run 

span(rf+i) = [ar\^f+i], 
and associate to it a cluster C^^^ of level k + 1, defined as 

Cf+i =span(rf+i)nr. 

It is made up from the clusters Ckj^. ,Ck,j^.^i, ■ ■ ■ ,Ck,j^.^„._i- In this case, 

the clusters Ckj^. ,Ck.j^.^i, ■ ■ ■Ck,j,.+„._i are called constituents of C^^^- To the 
cluster C^^^ we attribute the mass m{C^^^) by the following rule: 

Si+ni — 1 Si+ni — 1 

m{C^+^)=m{Ck,j,J+ i'^(^k,jj-k) = ^ m{Ck,jJ-k{ni-l). {U) 

S=Si + l S=Si 

The points a'^~^^ and wf"*"^ will also be called, respectively, starts- and end- 
point of the cluster Cf"*"^, and are also written as a(Cf^^) and u){Cf~^^). 

By Cfe+i,fc+i we denote the set of all level {k + 1) clusters. Take f.^^ = 
{C e Cfe: C n span(rf+^) = 0, i = 1,2,...}. Finally we define Cfe+i 
Cfe-|-i_fe+i U C^;j,_|_]^. We label the elements of Cfe+i as Ck+i,j,j > 1, in in- 
creasing order. Note that a cluster in Cfc is also a cluster in C^+i if and only 
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if it is disjoint from the span of each maximal (fc + l)-run of length at least 2. 
Thus Cfe+i may contain some clusters of level no more than k, but some clusters 
(of level < k) in Cfc no longer appear in Cfc+i (or any Ck+j with j > 1). 

Note also that in the formation of a cluster of level (fc + 1), clusters of mass 
at most k might be incorporated while taking the span of a (fc + l)-run; they 
form what we call dust (of level at most k — 1) in between the constituents, 
which have mass at least fc + 1. 

This describes the construction of the Cfc. We next show by induction that 

Cfc is a partition of F and Cfc is a refinement of Cfe+i (45) 

for fc > 0. This is clear for fc = 0, since Co is the partition of F into singletons. 
If we already know (HSJ for < k < K , then it follows also ioi k = K + 1 from 
the fact that clusters in Cfc+i are formed from the clusters in Cfc by combining 
the consecutive clusters between the start- and end-point of a maximal (fc -I- 1)- 
run into one cluster. Thus it takes a number of successive clusters in Cfc and 
combines them into one cluster. This establishes (|45[) for all fc. 
The definition of Cfc shows that 

Cfc^fc c Cfc c Cfc^fc U C'j._;^ J, c Cfc^fc U Cfc_i, 

from which we obtain by induction that (|42l) holds, as well as 

e{C) = min{fc: C E Cfc} 

for any C E Ufc>iCfc. 

We use induction once more to show that for any fc > 

m(C) > e{C) + 1 for any C e |J C^, (46) 

o<e<k 

and if C^^^ is formed from the constituents Ck.j^. , ■ ■ ■ TCkj^ +n -i with rij > 2, 
then 

to(C*^^^) > max m{Ckj^) + rii — 1 > max m{Ck.j^)- (47) 

Indeed, trivially holds for fc = 0. Moreover, if holds for fc < X, then 
(|47|) for fc = if follows from the rule (|44]) (and > 2). In turn, (gT)) and ((46|) 
for k < K imply 

m(Cf+^) > max m{Ck.jJ + l, (48) 

and hence also (|46p for k = K + 1. 

So far we have shown that Cfc+i is a partition of F which satisfies ([34| and 
([35]) with fc replaced by fc -I- 1 (by the definition of Cfc and induction on fc). 
We next show by an indirect proof that this is also true for (f36l) . It is convenient 
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to first prove the following claim: 

Claim. If i > 1, C G Uj>oCt+j and £{C) < t, then we have 

C £ C,,,+i for 1{C) <s< (to(C) - 1) a i. (49) 

(see definition (|43p V To see this, define 's as the smallest s > 1{C) for which 
C ^ Cs,s+i, and assume that 's < {m(C) — \)/\t. Then m{C) > s'+ 1, so that we 
must have C ^ Cj. But also C £ Cs-_i^s- C Cj_i. (Note that 's — 1{C) cannot 
occur, because one always has C G Ci,i+i for / = 1{C), by virtue of (06]).) But 
then it must be the case that C intersects span (rf) for some i. In fact, by our 
construction, C must then be a constituent of some cluster in corresponding 
to a maximal s-run of length at least 2. But then C does not appear in Cs+j for 
any j > 0, and in particular C ^ Uj>oCt+j^ contrary to our assumption. Thus, 
s < {m{C) — 1) A i is impossible and our claim must hold. 

We now turn to the proof of (l36l) . This is obvious for fc = or fc = 1. 
Assume then that (1551) has been proven for some fc > 1. Assume further, to 
derive a contradiction, that C and C" are two distinct clusters in C^+i such 
that min{TO(C'),m(C")} > r but d{C',C") < for some r <k + l. Without 
loss of generality we take r — m{C') A m(C") A (fc + 1). Let C and C" have 
level V and I" , respectively. Since these clusters belong to Cfc+i we must have 
max(/',?") < fc + 1. For the sake of argument, let V < I". If I' ^ I" ^ k + 1, 
then d{C',C") > M^^^, because, by construction, two distinct clusters of level 
fc + 1 have distance at least M'^+^. In this case we don't have d{C',C") < M^, 
so that we may assume I' < k + 1. 

Now first assume that r — 1 > max{l' , I") = I" . Since r — 1 < fc we then have 
by (|49p (with t = k) that C and C" both belong to Cr-i.r- If the distance from 
C to the nearest cluster in Cr-i,r is less than U' , then C will be a constituent 
of a cluster of level r and C will not be an element of Cfc+i. Thus it must be 
the case that the distance from C to the nearest cluster in Cr-i,,- is at least 
M\ A fortiori, d(C',C") > Af. This contradicts our choice of C',C". 

The only case left to consider is when r — 1 < max{£' ,£") = £". Since 
r - 1 = (m(C') - 1) A (m(C") - 1) A fc > ^' A f A fc = ^' (by recall that 

£' < fc + 1 now) this means I' < r — 1 < £" . We still have as in the last 
paragraph that C G Cr_i,rj and that the distance between C and the nearest 
cluster in Cr-i,r is at least . By (pSj) span (C) and span (C") have to be 
disjoint. For the sake of argument let us further assume that C lies to the left 
of C", that is, uj{C') < a{C"). We claim that a{C") =aiC) for some cluster 
C e Cr_i,r- Indeed, the start-point of a cluster of level £ > 2 equals the start- 
point of one of its constituents, which belongs to j C Cj_^. Repetition of 
this argument shows that a(C") is also the start-point of a cluster C which is a 
constituent of some cluster C such that s := £{C) < r — 1 but t + 1 :— £{C) > r. 
In particular, C £ Ct^t+i, so that C £ Ct and m(C) > t + 1 > r. Thus 
£{C) <r-l < (m(C) -1) At. It then follows from ^ that C £ C^^i^r- As in 
the preceding case we then have 
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d(C', C") > a{C") - cj(C') = a{C) - lj{C') (50) 

> the distance from C' to the nearest cluster m Cr~i.r (51) 

> Af. (52) 

Of course the inequahty d{C' ,C") remains valid if C lies to the right of C" , 
so that we have arrived at a contradiction in all cases, and p6p with k replaced 
by fc + 1 must hold. This completes the proof of (|36)) . 

Construction of Coo- Observe that each x G F may belong to clusters of 
several levels, but not to different clusters of the same level (see (155]) '). If C and 
C" are two clusters of levels i' and i" , respectively, with £' < £" , then 

span (C) n span (C") ^ implies span (C) C span (C"). (53) 

There will even have be a sequence Co — C\Ci, . . . ,Cs,Cs+i — C" such that Ci 
is a constituent of Ci+i,0 < i < s. This follows from the fact that each Cfc is 
a partition of V and that C^ is a refinement of C^.+i. In fact, each element 
of Cfc+i is obtained by combining several constituents which are consecutive 
elements of Cfc. (We allow here that an element of Cfc is already an element 
of Cfc+i by itself.) In turn, we see then from (j48|) that m{C") > m{C'). In 
particular, no point belongs to two different clusters with the same mass. We 
shall use this fact in the proof of the next lemma. 
We define the random index 

k(x) — sup{^: X G C for some C £ Ci/}- (54) 

If we allow the value oo for k.{x), then this index is always well defined, since 
each X g r belongs at least to the cluster {x} of level 0. 

Lemma 21. Assume that the sequence ^ is distributed according to Pp. If p > 
and 3 < A/ < (64p)^^/^ we have a.s. k{x) < oo, for all a; G F. 

Lemma [21] can be used for the construction of Coo • It tells that with Pp- 
probability one, for each x G F, there exists a cluster of level k(x) G Z+ which 
contains x. This cluster is unique, since the elements of Ck,k are pairwise 
disjoint. We call it the maximal cluster of x and denote it by V^- Moreover, 
for x,x' G F, if x' G then k{x) = k{x') and T^x = T^x'- Indeed, k{x) ^ k(x') 
would contradict (l53l) and the definition of k, while k{x) = n{x') but ^ V^' 
is impossible by (|35l) . 

Take xi = xi G F = {^j}j>i and define Coo,i — 'Dxi- Having defined 
Coo J = T^xj for j = 1, . . ■ , k, we set Xfc+i = min{xj G F: x^ ^ ujLjCoo.i}, and 
Coo,fc+i = T^'ik+i- F)efine Coo = {Coo,fc}fc>i. Clearly, F = Ufc>i Coo,fc- It is also 
routine to check that Coo satisfies ((38)) and (p9|. As for pO)) . this follows from 
(|36l) and the fact that G Cj. j.^^ C Cfc+i for all k > k{x) (by the definitions 
of k{x) and C^ fc+i). 
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The proof of Lemma [?T] involves an exponentially small upper bound (in k) 
for the probability of having a cluster starting at a fixed point and having mass 
k. 

Let S(p) denote the event of full probability in Lemma ITU where the above 
construction of Coo is well set. We then define 

X(0 inf{fc > 0: d{C, 0) > Af'"^^) for all C £ Coo with m{C) > k} (55) 

with xiO = cxD if the above set is empty or ^ ^ ^(p)- 

As a consequence of Lemma [H] one has 

Proposition 22. Let p < g^^p-- Then 

Pp(e: X(0 < oo) = 1 and¥p{xiO=0) > 0. 

Having in mind an application of the above grouping procedure to the proof 
of the main result, and using the notation introduced above, to each binary 
sequence ^ G S(p) we associate the sequence i/'^ G * defined as follows: 

= i^i'^ocj), if i^Xj- J2Co diam(Coo,t) ; ^^g^ 
* 1 0, otherwise , 

where diam(C) denotes the diameter of C (for the Euclidean distance). As an 
immediate corollary of Proposition 1221 we have: 

Corollary 23. If < p < then Pp{e G S(p) : e *Af} > 0. 

Now we give an upper bound on the cardinality of a cluster in terms of its 
mass. For this we simply use the estimate on the cluster diameter obtained in 
the proof of Lemma [2T] 

Lemma 24. Let {) < p < g4pT o-nd ^ G S(p). Then 

sup{|C|: C G Coo, m(C) = fc} < MI^-^ 

for all fc G N. 

Proof. See ([75]) and ^ in the appendix. □ 



4 Proof of Theorem [3] 

The following Definition and Lemma will play important role in the proof of 
Theorem H 

Definition 25. For € we say that ^^j- ( if for any j > the following 
holds: 

ipj =0 <^ Cj = 0, 

^jj=k^k< Cj < SM''-\ (57) 
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Lemma 26. Let p < g4jp- and ^ G Eoo such that x{0 = 0. Let < 
1+J2(C) < 2+j3(^) < . . . denote the ordered elements of Ui>i{span{Coo ,i)\Coo ,i) ■ 
Set ^'^'^^ = ^ one? /or eoc/i n > 1 

:=A°^(^)(C("-i)). (58) 
T/ie Zimit ^ :— lini„_j.oc exists in Sqo a7^rf 

V-^ /(O =: Vi, 
where is given by (|56p anii t/ie function f was defined in ([6]) -([8]) . 
Proof. The proof follows at once from the previous construction and Lemma 

m □ 

We now construct the binary sequence 77 that appears in Theorem [3] It is 
obtained from ({L), see (|T9l) . replacing each entry {({L))j = k by a. string of 
3M'^~^ consecutive ones, with AI — 3{L + 1), and correspondingly shifting the 
rest of the sequence to the right. That is, fix L > 2, and define C(-^) G ^ as 
follows: 

(C(L)), J^^^'^"^' if^'^b-dL'^+M., (59) 

for all j > 1, with M = 3(L + 1). Wc then let rj{L) be the unique element of 
Eoo such that ({L) = f{rj{L)), where / was defined by ©-([H]) in Section [2] 

We can now state and prove the following result. 

Theorem 27. Let L > 2. If p < 57g(-2+i)2 and the deterministic binary se- 
quence ri{L) is defined as above, then 

Vp{^ e S: (77(L),0 «s compatible} > 0. (60) 

Proof Let L > 2 and M = 3(L + 1). If p < Corollary US says that 

Pp{^ e S(p): iIj^ £ *Af} > 0, and from CoroUary[^it follows that 

Pp{^ e E{p); (C(£),-0*) is compatible} > 0. 

On the other hand, using Lemma [26] it is simple to check that if the pair 
{({L),ip^) is compatible, then so is {({L),iIj). The proof then follows by re- 
calling Proposition [51 □ 

To conclude the proof of Theorem [3l it now remains to check that the set 
of zeroes Z{L) in the sequence ry(L) in Theorem (|27l) has discrete Hausdorff 
dimension that tends to one as L tends to infinity. As one verifies at once, its 
asymptotic density is zero. 

We use the notion of discrete Hausdorff dimension, as introduced in ^2j , and 
related results from [1] . Among the simplest measures of the asymptotic size of 
a set A CZ, let us recall the following: 
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Definition 28. The lower and upper mass dimensions of A C 1, can be defined 
as 

diniLM {A) = hm mf — i 1 ^ i j\ ^ /g^^A 

n^oo log n 

/.^ logU n [-n/2,n/2)| 
dimuM {A) = lim sup — ' ' ' >\ ^ (62) 

n^oo log n 

and i/diniLM(^) = diniuM(^); we call it the mass dimension of A. 
A simple computation shows thall3 

logi 



dimLM(2^(i)) = dimuM(-2'(L)) 



log(3(L+l))- 



To recall the definition of discrete Hausdorff dimension introduced in [2] we need 
some notation: 

Let I denote the set of all intervals [x,y), x, y G Z. Given positive integers 
r, n, r > 2, we denote 

li'^ = [-r,r) 

/W = [-r",r")\[-r"-i,r"-i), n>2. 
Given a > and AjFCZ, set 

iy^{A, F) = inf { ■.B,eI,AnFC U,B,| (d(f^))-", (63) 

i 

and 

oo 

m(:HA) = Y.i^M,lL^^). (64) 

n=l 

Definition 29. The discrete Hausdorff dimension of a set A is defined by 

dimH(A) = inf{a > 0: m^'H^) < oo}. (65) 
Another useful notion is 

Definition 30. The upper entropy index of a set A is defined by 
A{A) = inf {a > : max {(dr-")"7V(d, A n 4'^^) ^ for each e > 0}}, 

l<d<r"(i- = ) 

(66) 

where N(d^ A) denotes the maximum number of disjoint intervals in X of length 
2d and with centers in A. 



^To match exactly to the previous definition, we may take the two-sided version of the 
hierarchical sequence. 
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Remark. It is easy to see that 

< dimH(A) < diniuM(^) < A(A) < 1. (67) 

There is also a related notion introduced in jj], called discrete packing di- 
mension, denoted by dinip, which uses the concept of packing measure suitably 
adapted to the discrete setup (see 2,, Sect. 3, p. 130). This notion was used 
by Barlow and Taylor ^ to define fractal sets in Z, as those for which there 
is equality of the HausdorfF and packing dimensions. On the other hand, the 
upper entropy index A always coincides with the packing dimension, as proven 
in [2] (Lemma 3.1, p. 131). Thus, one may equivalently say: 

Definition 31. The set A is called fractal i/diniH(A) = A{A) = dinip(A). 

To determine the Hausdorff dimension oi Z{L) we use the following result from 
[I] (a corollary of Theorem 4.1 in [5]) 

Proposition 32. (Part of Corollary 2 of [1 ) Let A C Z. Suppose there are 
positive constants c, c' such that for each large n E N and all integers x, 

lAn [-r",r")| > cV"" and |^ n [x - r", a; + r")| < cr"". (68) 

Then dimH(^) — dimuM(^). 

To see Z{L) is a fractal we use the following statement from [T]: 

Proposition 33. (Theorem 3 of ITj) Let A C Z. Suppose there are positive 
constants c, c' such that for all x E A and all n G N , 

lAn [2:-r",a; + r")| > cV"" and |yl n [-r", r")| < cr"". (69) 

Then A{A) = dimuM(^)- 

Conclusion of the proof of Theorem\^ Straightforward computations show that 
both conditions (|68)) and ((69)) are satisfied for Z{L). In particular, it is a fractal 
of Hausdorff dimension x^gl^fL+i)) ■ This concludes the proof. □ 

5 Appendix. Proof of Lemma [2T 

Proof of Lemma \21\ k(x) = +oo can occur only if there exists an infinite increas- 
ing subsequence of indices {fci}i>i such that the point x becomes "incorporated" 
into some cluster of level ki for alH > 1 . We will show that 

Pp {x belongs to an infinite sequence of clusters) — 0. (70) 

Notice that each cluster of level k necessarily has mass at least fc + 1 and no 
point belongs to two different clusters of the same mass, as already observed. 
Setting 

Ak{x) — [x belongs to a cluster of mass fc], (71) 
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we shall show that for each fixed x 

Pp(Afc(x) i.o. in k) = 0, (72) 

which will prove ([70)) . 

We will carry out the proof in two steps. All constants Ci below are strictly 
positive and independent of k. First we estimate the probability that a given 
point 2 G Z+ is the start-point of a cluster of mass k >2. Specifically, we show 
that 

Pp (3 C e U£>iC^ : a{C) = z, m{C) = fc) < cie-"^*-' (73) 

for some strictly positive constants ci,C2 > and for each fixed k. In fact we 
can take C2 > log L so that 

ci(L'^ + 1)6^'=^'^ < 2cie~=^*^ (74) 

for some constant C3 > 0. This is the most involved part of the proof. In the 
second step of the proof we show that if C G U^>iC^ and m(C) = k, then 

diam(C) < 3L''-\ (75) 

Due to d75|) we will have the following inclusion: 

Ak{x) C [3z e[x - L^, x] : a{C) = z, for some C £ \Ji>iCi and m(C) = k\ . 

(76) 

^From dZSl), dZll) and we will have 

^p{Ak{x)) < ci(L*^ + l)e-^^*^ < 2cle-^^^ (77) 

which, by the Borel-Cantelli lemma, gives (|72p. and so ((70)) . 

Let us now prove (1731) . where fc > 2 and z G Z_|_. To any given cluster 
C G Uf>iC£ we associate a "genealogical weighted tree". It describes the suc- 
cessive merging processes which lead to the creation of C, i.e., it tells the levels 
at which some clusters form runs, merging into larger clusters and how many 
constituents entered each run, down to level 1, and finally the masses of such 
level 1 clusters. So we represent it as a tree with the root corresponding to C; 
the leaves correspond to clusters of level 1, which are the basic constituents at 
level 1. This weighted tree gives the basic information on the cluster, neglecting 
what was incorporated as "dust" , on the way. 

More formally, we construct the tree iteratively. The root of the tree corre- 
sponds to the cluster C. If this cluster is of level 1, the procedure is stopped. 
For notational consistency such a tree will be called a 1-leaf tree. To the root 
we attribute the index 1, as well as another index which equals the mass of the 
cluster. 

If the resulting cluster C is of level £ > 1, we attribute to the root the index £ 
and add to the graph rii edges (children) going out from the root, where tii > 2 
is the number of constituents which form the £-rmi leading to C. Each endvertex 
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of a newly added edge will correspond to a constituent of the run, i.e., if C has 
constituents C^-i^ij, . . . ,C£_i_i^^ € Ci-i^i, for suitable ii, . . . ,ini, then there is 
a vertex at the end of an edge going out from the root corresponding to Ci-i,ij 
for each j = I, . . . ,ni. If the constituent corresponding to a given end vertex is 
a level 1-cluster, the procedure at this endvertex is stopped (producing a leaf 
on the tree), and to this leaf we attribute an index, which equals the mass of 
the corresponding constituent. 

If a given endvertex corresponds to a cluster C of level £' with !<£'<£, 
then to this endvertex we attribute the index £' , and add to the graph 712 new 
edges going out of this endvertex, where n2 is the number of constituents of C 
in Ci'-i^i' which make up C. 

The procedure continues until we reach the state that all constituents cor- 
responding to newly added edges are level 1 clusters. In this way we obtain a 
tree with the following properties: 

i) each vertex of the tree has either or at least two offspring; in case of 
offspring we say that the vertex is a leaf of the tree. Otherwise we call it a 
branch node. 

ii) to each branch node x we attribute an index l^] these indices are strictly 
decreasing to 1 along any selfavoiding path from the root to a leaf of the tree. 

iii) to each leaf is associated a mass m > 1. This defines a map 

7: C e U,>iQ ^ 7(C) = (T(C),r(C),m(C)), 

where T(C) is a finite tree with £(T(C)) leaves and A/'(T(C)) branching nodes. 
We use the following notation: 7(C) = {^i(C), . . . , ^jv'(T(c))(C)} is a multi-index 

with one component for each branching node of T(C), which indicates the level 
at which branches "merge" into the cluster corresponding to the node; 

m(C) = {mi(C), . . . , ?7T-£(T(c))(C)} ^ multi-index with one component for each 
leaf of T(C), which gives to the mass of the cluster corresponding to the leaf; 

fi(C) = {rii(C), . . . , "■A''(T(c))(C)} is a multi-index with one components for each 
vertex of T(C), which gives the degree of the vertex minus 1. Note that n{C) is 
determined by T(C). 

To lighten the notation, we will omit the argument C in situations where 
confusion is unlikely. Thus we occasionally write 7(C) = {T,l,m) instead of 
(T(C),r(C),m(C)). 

In order to prove (1731) we decompose the event 

[3C eUe>iCi: a{C) = z,m{C) = k] (78) 

according to the possible values for 7(C); we shall abbreviate the number of 
leaves of T(C) by C Since the resulting cluster C, obtained after all merging 
process "along the tree", has mass k, it imposes the following relation between 
the multi-indices m and /: 

c Af 

^m.-^(n,-l)(£,-l) = fc (79) 
i=i j=i 
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Here the first sum runs over all leaves, while the second sum runs over all 
branching nodes. This relation follows from (j44l) by induction on the number of 
vertices, by writing the tree as the "union" of the root and the subtrees which 
remain after removing the root. We note that T also has to satisfy 

A/" 

J2{n, - 1) = £ - 1, (80) 

because it is a tree, as one easily sees by induction on the number of leaves. 
This implies the further restriction 

c 

rrii > k + C — 1, 

1=1 

because Ij > 2 in each term of the second sum in ([7^ (recall that we stop our 
tree construction at each node corresponding to a cluster of level 1). Thus the 
probability of the event in ([75)) equals to 

E E ^''Pp(3CGU,>iC,:z = a(C),m(C) = fc,7(C) = (T,r,m)), 

r>lT:£(T)=r r,m 

(81) 

where the third sum X^Trfi taken over all possible values of /, m, satisfying 
(ESI). 

A decomposition according to the value of the sum ^^mi, shows that the 
expression ([8T|) equals 



E E E E EPp(3^^U,,,C,:a(C)=z, m(C) = fc,7(C) = (T,/",m)) 

r>l T: s>r~l m: I 
CiT)=r 

=k+s 

(82) 

the sum being taken over possible choices of I such that J2j ("j ^ 1) (^j ^ 1) = 
s. The multiple sum in (1821) can be bounded from above by 



E E E E E^^'"^^'""- (83) 

r>l T: s>r— 1 ni: [ 
£(T)=r Y,, mi=k+s 

Indeed, for fixed z, k and (T, l,m), the probability 

Pp (3C: a{C) = z, m(C) = k,j{C) = (T,/",m)) 

is easily estimated by the following argument: the probability to find a level 1 
cluster of mass nii which corresponds to some leaf of the tree, and which starts 
at a given point x, is bounded from above by p"^i j^-mi-i ^ Indeed, such a cluster 
has to come from a maximal level 1 run Xs^Xs+l^ ■ ■ ■ ,Xs+mi-i of elements of 
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r, with Xs = X and Xj+i ~ Xj < L for j = s, . . . , s + rrii ~ 2. The number 
of choices for such a run is at most L™'^^, and given the Xj, the probability 
that they all lie in F is p™'. Similarly, the probability to find two level 1 
clusters of mass m^^ and rrii^ which merge at level ij can be bounded above 
by p"^ii L"^ii-^p"^i2 L"^i2-^ L^i , The factor L^^ here is an upper bound for the 
number of choices for the distance between the two clusters; if they are to merge 
at level tj, their distance can be at most L^^ . Iterating this argument we get 
that 

Pp (3C e Uf>iCf : a{C) = z, m(C) = fc,7(C) = (T,Z",m)) 
and taking into account that 

Y,{m. - 1) + E("^- - 1)^^- E + E(^^- - 

i 

as well as ([SO)) , we get the bound 

The number of terms in the sums of (155)) over to and I are respectively 
bounded by 2'=+" and 2" (since ^^(^j - 1) < EjK" ^ - 1) = s and 

ij > 2). Thus we can bound ([551) from above by 

EY^ ofc+sos fc+s7-fc+2s 

r>l 2^T:£(T)=r 2^s>r-l ^ ^-P ''^ 

provided we take ApL^ < 1. Now the number of planted plane trees of u vertices 
is at most 4" (see [6]). Our trees have r leaves, but all vertices which are not 
leaves have degree at least 3 (except, possibly, the root). Thus, by virtue of 
([50]). these trees have at most 2r vertices. The number of possibilities for T in 
the last sum is therefore at most E^Lr+i 4" < |42'' < 2 ■ 42''. It follows that 
(|5H) is further bounded by 

^ r>l ^ r>l 

If we take 6ApL^ < 1, this can be bounded by 

32(2pL)'= 
{l-4pL^){l - QApL^y 

which proves (|73)) and ([74)) with C2 = — log(2p) — log L > log L for our choice of 
p,L. It remains to show ([75)) . It is trivially correct for A; = 1; in fact a cluster of 



-i)-Ei + EK-i) 

-1), 
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mass 1 has to be a singleton by (|T7)) . We will use induction on k. Assume ([75|l 
holds for all clusters with mass at most k — 1, where k > 2. Let C be a cluster 
with m(C) = k and of level i. Thus C G Cij, and 1 < ^ < fc — 1, by virtue of 
gS}. li e = I then diam(C) < (fc - 1)L < iL^-^ , for fc > 2, provided we take 
L > 2. If ^ > 2, then there exist n > 2, and C^^i^i^ , . . . C£_i_i,^ G Ci-i^g such that 
C is made up from the constituents Cf-i.i^ , . . . C^-i.^^ (where, for simplicity, we 
have omitted the indication of the level of the constituents). If nij — m{Ci-i^i.), 
then rrij > I (by ([iS]) ). and from (pi)) we see that to^ < /c — n + 1 for each j. 
From this and the induction hypothesis we get 

n 

diam(C) < ^ diam(C<._i,iJ + (n - 1)L^ < SnL'^"" + (n - 1)L'''-"+i < SL*^"^ 

(85) 

for all L > 3 and n > 2. This proves ([75]) and the lemma. □ 
Proof of Proposition dH From ([73l) and (|74|) we have 



Pp(3Coo,,: m(Coo,,) > k,d{C^^,,0) < < ^ cii™e-^='" (86) 

which tends to zero as fc — oo, proving that %(•) < oo a.s. 

Now if ^ G S(p) is such that xiO is finite and non-zero, then there exists a 
unique cluster C* e Coo(^) such that m(C*) = x(0 and d(C*,0) < The 
existence of C* follows at once from the definition of x- For the uniqueness we 
observe that if two such clusters, say C and C" , would exist, then they would 
have to satisfy d{C' ,C") < ^ imin{m{c')MC")} ^ ^hich contradicts ^ by 

virtue of the assumption C, C" S Coo- 

We now construct a new environment ^ (depending on ^): when x(?) = 
we let = for all i > 1. On the other hand, if < xiO < oo we set, 

1^4, ifz>w(C*). 

We shall now show that 

xiO = 0. (88) 

Of course we only have to check this in the case < x(C) < We claim 
that Coo(C) is also well defined an all clusters in Coo(C) are also clusters in 
Coo(f) (which are located in [uj{C*) + 1, oo)) and the masses of such a cluster in 
the two environments ^ and ^ are the same. To see this we simply run through 
the construction of the clusters in Ui>iCi in the environment ^, until there 
arises a difference between these this construction and the construction in the 
environment ^. More precisely, we apply induction with respect to the level of 
the clusters. Clearly any cluster of level in ^ is simply a single point of r(f ) 
which lies in [uj{C*) + l,oo), and has mass 1. This is also a cluster of level and 
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mass 1 in ^. Assume now that we already know that any cluster in ^ of level 
at most is a cluster of ^ of level k and located in [w(C*) + 1, oo) and with the 
same mass in ^ and ^. 

Since = for i < w(C*), the span of any (fc + l)-run in ^ has to be 
contained in [uj{C*) + 1, oo). Therefore the span of any cluster of level fc + 1 in 
environment ^ also has to be contained in [uj{C*) + 1, oo). In addition, since the 
two environments ^ and ^ agree in this interval, a difference in the constructions 
or masses of some cluster of level fc + 1 can arise only because in ^ there is a 
(fc + l)-run which contains clusters of level at most k which lie in [uj{C*) + 1, oo) 
as well as clusters which intersect [0,a;(C*)]. But then these clusters of level at 
most k will be constituents of a single (fc + l)-cluster, C G Coo(C) say. Thus 
span(C) has to contain points of both [0,a;(C*)] and of [uj{C*) + l,oo) in ^. 
Consequently, span(C) has to contain both points w(C*) and i^(C*) + 1. Since 
lli{C*) G C* we then have from that span(C*) C span(C) and C* ^ C 
(because w(C*) + 1 ^ span(C*)). But no such C can exist, because C* G Cqo. 

This establishes our last claim. Now, by definition of x, is equivalent to 

d(C,0) > L™^'^) (89) 

for all clusters C in Coo(0- view of our claim this will be implied by (jHH) for 
all clusters C in Coo(C) located in [aj(C*),oo). Now, if C is such a cluster with 
m{C) <m{C*), then holds, because, by virtue of (HTI) . 

d(C,0) = a{C) > a(C) - w(C*) = d{C,C*) > V''^^^ 

On the other hand, if m(C) > m{C*) ~ x(C)i then the definition of x shows that 
we have a(C) > L"^^'-'\ This proves ([M)) in all cases, and therefore also proves 

(IMl)- 

We now have 

oo 

1 = Pp(x(e) < oo) < Pp(x(C) = 0) + ^ PpHn = n, xi&^) = 0), 

n=0 

where uj{C*) is as described above, and ^'^"^ given by 

iii>n. 

Thus, either Pp(x(0 = 0) > or there is some non-random n G Z+ for 
which Pp(x(C'"^) = 0) > 0. However, 



Pp(x(0 = 0) > PixiO = 0, 6: = for < * < n) 

= Pp(x(e*"^) - 0, 6 = for < i < n) 
= Pp(x(e*"^) = 0)Pp(e<=OforO<i<n) 
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(because is determined by the with i > n). Thus Pp(x(C) = 0) > in all 
cases, concluding the proof. □ 
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